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STABLE INTEGRANDS 

The present invention relates to transposable elements and methods for their 
incorporation into the genome. 

Transposable element technology has enabled the genetic transformation of a wide range 
of insects [c.f Handler, A. (2001), Insect Biochem Mol. Biol. 57, 111-128; Handler, A. (2002), 
Insect Biochem. Mol. Biol. 52, 1211-1220; Horn, et aL, (2002), Insect Biochem. Mol. Biol. 52, 
1221-1235], and at least some of these transposable elements have also been shown to be mobile 
over a much wider phylogenetic range, including prokaryotes and vertebrates (Handler, 2001, 
supra). 

Transposons have been described extensively in the prior art. The elements used for 
genetic transformation of insects are generally characterised by opposing inverted repeat 
sequences and are associated with an optionally integral transposase enzyme. The transposase 
recognises the inverted repeat sequence and excises the sequences, together with any intervening 
DNA, and then reinserts the resulting transposon at another site, either in the genome or in a 
plasmid. 

An autonomous transposon encodes its own transposase; for a type II transposon this is in 
the DNA between the inverted repeat sequences. Such autonomous transposons are not stably 
incorporated into the genome, as they are liable to move at random. Accordingly, where such a 
transposon is to be used to incorporate target DNA into a genome, it is preferred to provide the 
transposase on a helper plasmid, for example, so that transposition is dependent on the 
availability of the helper plasmid. 

This system for incorporating genes or nucleic acid sequences into genomes is effective, 
but the transposed DNA is still associated with the inverted repeat sequences of the transposon, 
so that any future exposure to transposase can result in movement of the transposon, possibly 
even across the species barrier, although this unlikely. 

The presence, or potential presence, of transposases capable of remobilising these 
insertions has led to objections to the use of this technology, particularly when the transformed 
organisms are for release into the environment. In addition, the presence of such transposases 
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will tend to destabilise the insertions, which is generally undesirable. Though laboratory strains 
can be screened for the presence of related transposases by any of several methods, based, for 
example, on nucleic acid sequence similarity, such as by hybridisation or PCR, or function, such 
as plasmid-to-plasmid transposition assays or plasmid excision assays, it is not possible 
exhaustively to test all wild populations. 

It is therefore desirable to develop transformation methods which lead to a transgenic line 
which is insensitive to transposase, or in which the insertion is insensitive to transposase. 

In order to stabilise the genetic insert, it is possible to provide three or more repeats, and 
locate the insert between two repeats in the same orientation. It follows that the number of 
transposable elements is equivalent to the number of repeats in one orientation multiplied by the 
number of repeats having the opposite orientation. Exposing the construct to transposase will 
result in all of the possible transposable elements being obtained, including the entire length of 
the transposon containing the gene of interest, which can then be inserted into the genome. The 
resulting insertion can then lose the shorter length which does not contain the insert by further, or 
the same, transposase activity. This shorter length is a transposable element flanked by inverted 
repeats. Once removed, it leaves the desired genetic insert in place, but with only one repeat i.e. 
no longer within a functional transposable element, thereby disabling the ability of transposase to 
excise the genetic insert. This process is subject to extremely low success rates, however, as the 
transposase will usually generate the shortest length transposable element. 

Thus, transposable elements with repeated ends can potentially utilise any of their 
repeated ends for transposition or excision. It is therefore possible to obtain transposition using a 
first distal end, followed by excision using another, more proximal end, leaving a fragment 
behind. This scheme is illustrated in Figure 1. 

In Figure 1, triangles A-C represent the functional ends of the transposon, for example, 
the short inverted repeats at the ends of a class II element such as piggyBac. Transposition can 
occur between any two opposed triangles: in this case, A-B and A-C. Flanking transposon DNA 
may also be included, see below. An appropriate target site sequence, for example, TTAA or 
( A/T)N( A/T)TTAA( A/T)N( A/T) i s a Iso a ssumed t o b e included i n t hese e xamples. P roximal 
and distal are relative to the 5' end. The construct of Figure 1A is introduced into cells or 
embryos, for example, by microinjection, transfection, or ballistic or other methods, with a 
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suitable transposase helper, for example, helper plasmid, RNA, transposase protein or integrated 
transposase source, also known as a jumpstarter. 

A transformant containing the integrated product must then be selected, as shown in 
Figure IB. 

The selected transformant is then exposed to a suitable source of active transposase, for 
example, helper plasmid, RNA, transposase protein or an integrated transposase source, to 
generate the specific excision product shown in Figure 1C. The resulting product is substantially 
or completely stable to further exposure to active transposase, relative to the starting construct. 

In the above e xample, t wo i ntegration e vents are p ossible i n t he i nitial transformation 
step, then two excisions are possible once the desired integrant has been obtained. The 
alternative integration utilises the proximal 3* end, resulting in the insertion of a smaller 
transposon than desired, lacking the DNA of interest; the alternative excision utilises the distal 3* 
end, resulting in the complete excision of the construct and hence excision of the DNA of 
interest. Insect transformation is an inefficient process, so it is desirable to optimise the 
efficiency of this step. With suitable markers, even inefficient or rare excision may be 
acceptable, as an integrated transposase source can be used, so that it is possible to score large 
numbers of insects without requiring microinjection. 

It is, therefore, highly desirable to maximise the efficiency of the first step, so as to 
provide the greatest possible number of initial transformants obtained by microinjection, for 
example, and obtain as many full-length insertions as possible and to maximise the ratio of full- 
length to short insertions. 

The ,f 3-end H system described above and illustrated in Fig.l has another weakness: the 
presence in the final product of a transposon end adjacent to the DNA of interest. In principle, 
insertion of another identical or cross-mobilising transposable element nearby could reconstitute 
a composite transposon and thereby destabilise the DNA of interest. There is, therefore, a need 
for a method to generate insertions in which no transposon ends remain. 

Another available option for insect transformation involves adding another mobile 
element near or at one end of the transposon and rely on imprecise excision of this element to 
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stabilise the insertion by deleting one end. This is an inherently unsatisfactory method as the 
nature of the deletion is unpredictable and uncontrollable. 

Another method for generating transgenics without flanking transposon DNA is well 
known in Drosophila, though not for any other insect. This is the homologous recombination 
system of Rong and Golic (Rong, Y., and Golic, K. (2000), Science 288, 2013-2018; Rong, Y., 
and Golic, K. (2001), Genetics J 57, 1307-1312). Essentially, the contents of a transposon insert 
are moved to another, predetermined chromosomal location. The new insertion need not have 
any transposon DNA, though it does have to have a target site for a site-specific recombinase, 
such as Flp/FRT. As originally described, this insertion is always associated with a large direct 
repeat. The main limitation of this scheme is the highly variable, and generally very low, rate at 
which the new homology-based insertions are recovered. It also requires several enzyme 
activities to be available, such as by prior establishment of transgenic lines, for example FLP and 
I-Scel. The main use of this system is to generate knock-outs or other targeted modifications to 
specific genes or sequences. 



Various attempts in the art have been made to improve the effectiveness of transposition. 
WO 02/46444 discloses a transposable polynucleotide suitable for use in methods for 
manipulating nucleic acids to create libraries of cells containing transposed nucleic acid, wherein 
the transposable polynucleotide comprises two or more inverted repeat sequence p airs. Each 
pair has a distinct and separable ability to interact with a distinct transposase enzyme. The pairs 
can, for instance, be provided in a nested fashion such that both members of one pair are flanked 
by both members of the second pair. For instance, the Tn5 transposon, described therein, 
includes "inside end" sequences and "outside end" sequences. A transposase is disclosed that 
preferentially binds and interacts with the outside ends but not the inside ends. 

WO 01/91802 discloses a chimeric nucleic acid vector comprising adenoviral inverted 
terminal repeat flanking regions comprising, between the inverted terminal repeats, retroviral 
long terminal repeat flanking regions, within which is further found a cassette comprising the 
DNA of interest and a gag nucleic acid region. Such vectors are capable of transducing all cis 
and trans components of a retroviral vector for the generation of a high tighter recombinant 
retroviral vector for use in in vivo gene transfer applications. 
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Wobus et al, Molecular and General Genetics, 1990, vol. 222, pp 311-316 describes a 
new transposable element from Chironomas thummi which is shown to have inverted repeats of 
17 base pairs at each termini. The transposable element is called TECthl, is 1.7kb long and was 
found in the 3' flaking region of a C. thummi Balbiani ring gene. 

US 2002/0173634 sets out to solve problems in the use of the piggyBac vector caused by 
lack of suitable restriction sites to cut the components needed for gene transfer, and limitations 
on the sizes of genes transferred by use of this vector. This is achieved by removing large 
portions of apparently non-essential DNA from the piggyBac transformation vector. 

US 6,200,800 Bl discloses retroviral vectors containing cw-acting viral elements from 
the expression, encapsidation, reverse transcription and integration of the retroviral genome 
nucleic acid sequence. A retroviral vector is provided for eliminating most of the viral elements 
which are not useful in the integrated provirus. The vector provided use, among other thing, the 
bacteriophage PI Cre-lox recombination system. 

Russ et al, J. Virology, 1996, vol. 70, pp. 4927-4932 discloses self-deleting retrovirus 
vectors for use in gene therapy. Russ et al discloses how retrovirus vectors containing a loxP 
site fuse to independently expressed selectable marker generates proviruses flanked by loxP. 
This enables the Cre recombinase to excise most of the provirus apart from the marker gene. 

Steiner S. et al, Genetics, 1995, vol. 140, pp. 973-987 shows that homologous 
recombination is the main mechanism for DNA integration and the cause of rearrangements in 
the filamentous ascomycete Ashbya gossypii. 

What is required is a simple system with enhanced rates of initial transformation with a 
desired gene or nucleotide sequence. 

Thus, according to a first aspect, the present invention provides a transposable element 
comprising at least four inverted repeats, forming at least two pairs of opposing pairs of inverted 
repeats, the element comprising DNA for insertion into a host genome, the DNA being located 
between two pairs of opposing repeats such that excision by a transposase or transposases of said 
pairs, in situ, is effective to be able to leave said DNA integrated into the host genome, without 
the presence of flanking transposon-derived repeats derived from said transposable element. 
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The DNA for insertion into a host genome is preferably a gene that is to be expressed in 
the host, although it is envisaged that it could preferably also be a genetic element that does not 
encode a protein, such as regulatory element, for instance a promoter or enhancer sequence. 

Alternatively, it is also preferred that the DNA for insertion into a host genome is simply 
a DNA sequence for insertion into a target sequence for the purposes of, for instance, altering the 
frame of a coding sequence, disrupting the function of a gene or even deleting a portion of DNA 
if, for instance, the transposition occurs between target sites some distance apart. 

A gene's function may be disrupted, for instance, by insertion of stop codon into the 
coding sequence thereof, insertion of suitable nucleotides sufficient to bring about a stop codon 
where one was not previously found in frame, or insertion of "junk" DNA into the coding 
sequence or untranslated regions, thereby disrupting the gene's promoter or enhancer, for 
instance. 

Preferably, the inverted repeats are "piggyBac" repeats. 

It is also preferred, the number of inverted repeats is 6, 7, 8 or more, but more preferably 
5 and most preferably 4, as transposases generally favour shorter sequences to transpose or 
excise, thus making it preferable to have as few inverted repeats as possible to minimise 
unwanted transpositions and excisions and to promote the intended transpositions and excisions. 
This is so as to, for instance, leave only the gene-of-interest remaining in the host genome, such 
that as much as possible of the genetic material derived from the transposon, with the exception 
of the gene-of-interest, is removed. 

Whilst it is preferred to remove only some of the transposon-derived DNA from the host 
genome, such as only one excisable transposon of opposing inverted repeats, it is more 
preferable to excise all transposon-derived DNA from the host genome, apart from the gene or 
DNA of interest. 

The inverted repeats, also referred to as "ends" in the art, are preferably homologous, 
such that the repeat sequence is the same, albeit inverted. This has the advantage of requiring 
only one transposase. Alternatively, it is also preferred that the different pairs of homologous 
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repeats are heterologous, such that they do not form opposing inverted repeats which may be 
transposable or excisable. Preferably, the heterologous repeats are recognised by different 
transposases, thereby allowing the user to exert control over different transposition and excision 
steps. 

The transposase is preferably autonomous, such that the transposase is encoded within 
the transposon itself However, it is also preferred that the transposase is provided by another 
means, such as a further genetic element comprising the transposase, for instance another 
transposon or preferably a helper plasmid, thereby also allowing the user a greater level of 
control as to when the various stages of the transposition and excision according to the present 
invention occur. 

The transformant is preferably exposed to a suitable source of active transposase, for 
example, a helper plasmid or RNA encoding the transposase, or a transposase protein or 
integrated transposase source, as known in the art. 

Preferably, one or more of the inverted repeats is a minimal repeat, as discussed below. 
In this instance, it is preferred that the minimal repeat is non-terminal, i.e. it is an internal repeat, 
as discussed below. 

It is also preferred that markers are associated with the transposon in order to allow the 
user to follow the progress of the various steps of transposition and excision and to determine in 
which individuals have been said steps have successful. Suitable markers and systems therefor 
are also discussed below. Preferably at least one genetic marker is associated with an 
identifiable step in the transposition/excision process, and more preferably, the marker is 
associated with the DNA for insertion into a host genome. 

Preferably, t he D N A for i nsertion i nto a h ost g enome d oes n ot c omprise a ny i nverted 
repeats or target sites for insertion of the present invention, which could lead to unwanted 
excision of part or all of a previous insertion, or to the transposable element of the present 
invention being transposed into a previous insertion. Although it may be impractical to have 
absolutely no inverted repeats or insertion sequences in the DNA for insertion into a host 
genome, it is preferred that these be kept to a minimum or that they are heterologous to, and 
therefore not recognised by, the transposase or transposases used by the present invention. 
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Opposing inverted repeats are capable of recognition by a suitable transposase, leading to 
transposition or excision of the repeat and intervening DNA. If the inverted repeats are not 
opposing, i.e. they are orientated in such a way that the second repeat sequence is not a 
complementary mirror image of the first repeat sequence, then no transposition or excision will 
occur between these repeats. This can be seen in Figure 1A, where repeats A and B are 
opposing, thereby allowing transposition/excision therebetween, whereas repeats B and C are not 
opposing. It can also be seen in Figure 3A, where the two repeats marked 5' are not opposing, 
nor is the left hand 5* repeat in relation to the left hand 3' repeat. In the diagrammatic 
representations used in Figures 1-4, an opposing pair of repeats is shown by a 5 f arrow and a 3 1 
pointing in opposite directions. 

The opposing inverted repeats of the present invention are preferably excised in situ, 
from the host genome, following initial transposition of the full-length element, comprising the 
at least four repeats and the DNA of interest, into the host genome. 

Preferably, the element comprises two external opposed inverted repeats, one on each 
side of an inversion cassette, the cassette comprising; 

the DNA for insertion into a host genome, two inverted cassette repeats and two 
inversion sites, the DNA for insertion into a host genome being flanked on either side by one of 
the inverted cassette repeats, each inverted cassette repeat being further flanked by an inversion 
site; 

the cassette being capable of inversion within the transposed element in situ in the 
presence of a recombinase, such that following inversion, the two inverted cassette repeats 
flanking the DNA for insertion into a host genome each separately form a further pair of 
opposing inverted repeats with one of the external inverted repeats, the further pairs of opposing 
repeats being excisable by a transposase in situ to leave said DNA without flanking transposon- 
derived repeats in the host genome. 

It will, of course, be appreciated that the repeats or inversion sites that, for instance, are 
said to be flanking or bounding another feature, are not necessarily intended to be directly 
adjacent thereto, but may be proximal or have a spacer therebetween, provided that this does not 
impair the function of either feature. In some cases, it is envisaged that such a spacer may be 
useful and may even include a marker, for instance. 
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The external inverter repeats are found on each side of the inversion cassette. Before 
inversion, the external opposing repeats form a pair of opposing inverted repeats transposable by 
a transposase, such that the repeats and the cassette comprised therebetween are transposed into 
the host genome by the transposase. Following inversion of the cassette in the host genome by a 
recombinase, each of the external repeats then forms a new excisable element or transposoh with 
an opposing inverted repeat that is now opposing as a result of the inversion. Two further 
newly-formed opposing pairs of inverted repeats are found, one on each side of the inverted 
cassette and are excisable. Being shorter than the pair formed between the two external repeats 
found at either end of the transposed and inverted element and comprising the DNA or gene of 
interest, the newly-formed repeats are favoured over the longer repeat and are excisable in 
preference thereto, leaving only the DNA of interest, preferably with minimal flanking 
sequences, in the host genome. 

A transposon-derived repeat is a repeat that originated in the transposable element and 
not from the host genome, or is homologous to, complementary to or a variant of that original 
repeat. 

The inversion sites are sequences recognised by inversion-inducing recombinase. 
Prefereably, such sites comprise the Flp or Cre sites from the Flp/FRT or Cre/lox inversion 
systems, or any other such systems known in the art. Preferably, the inversion sites are 
recognised by a directional recombinase, the recombinase-mediated inversion being essentially 
irreversible. Preferred sites of this type are, for instance, lox66 or lox71. 

The inversion cassette preferably comprises the DNA of interest, the two inverted 
cassette repeats which later go on to form one half of the opposing repeats that are used to excise 
the transposon-derived DNA, and the inversion sites, such as the FRT site of a Flp/FRT system 
or a lox site from a Cre/lox system, which direct the recombinase to the inversion of the 
intervening DNA. Preferably, the cassette may also comprise markers, as discussed below. 

The present invention also provides a method of transposition comprising use of the 
transposable element to insert DNA of interest into a host genome, together with the 
transformant or organisms produced by said method. 
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In one aspect the present invention preferably provides a transposable element 
comprising at least four inverted repeats, at least two of which are inverted in relation to the 
others, comprising DNA for insertion into a host genome located between two pairs of opposing 
repeats excisable by a transposase in situ to leave said DNA without flanking transposon-derived 
repeats in the host genome. 

Inverted repeats are sequences which are found in identical (but inverted) forms. In other 
words, the repeats have the same sequence running in the 5' to 3' direction, except that these 
sequences are on different strands of the DNA. For example, 5'-CCCTAG-3' and S'-CTAGGG- 
3' are inverted repeats as the second sequence is the mirror image of the first, except that the 
mirror image has also been converted into the antisense of the first, such that A has been 
converted to T, C to G and so forth. Thus, the complementary sequence of the second repeat 
reads 3'-GATCCC-5' (5'-CCCTAG-3')> the same of the first repeat, only on the complementary 
strand. 

The inverted repeat sequences of piggyBac transposons, for example, are associated with 
short, external TTAA sequences, which generally correspond to the insertion site. It is, 
therefore, preferred that such sequences are provided with the terminal inverted repeats of the 
present invention. 

Figure 3 illustrates a scheme involving two pairs of inverted repeats flanking the gene or 
DNA of interest, and how this may be used to generate an insert ultimately associated with no 
transposon repeats. 

The construct of Figure 3 A is injected with a suitable transposase helper, for example, 
helper plasmid, RNA, transposase protein or integrated transposase source. 

A transformant containing an integrated product as shown in Figure 3B may then be 
selected, and exposed to a suitable source of active transposase, for example, helper plasmid, 
RNA, transposase protein or integrated transposase source. 

Transformants containing either of the specific excision products shown in Figure 3C 
may then be selected. Further exposure to transposase may then be employed to obtain the final 
excision product shown in Figure 3D. 
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As mentioned above, the inverted repeat sequences of piggyBac transposons, for 
example, are associated with short, external TTAA sequences, which generally correspond to the 
insertion site. The inverted repeats are also associated with internal stretches of DNA. These 
internal stretches of DNA can be deleted to leave a minimal repeat, so that the repeats simply 
flank the desired DNA to be inserted. Transposons created in this manner are capable of 
excision and plasmid-to-plasmid transposition, and can do so with frequencies similar to those of 
constructs containing more internal piggyBac sequence (Elick et al., 1997, supra). However, 
such minimal piggyBac elements are not capable of efficient germline transformation, giving 
transposition frequencies approximately 20x lower than more complete elements. 

It has also recently been established that transposable activity is retained if part of the 
repeat is deleted, but that the resulting insert can no longer be excised by transposase activity at 
the truncated repeat. 

It has now been found that it is possible to provide a transposon with three or more 
inverted repeats of which a middle repeat is a minimal repeat, such a transposon providing 
significantly enhanced levels of initial transformation. 

Thus, in a further aspect, the present invention provides a transposable element 
comprising at least three inverted repeats, at least one of which is inverted in relation to the 
others, wherein at least one non-terminal repeat is a minimal repeat. 

Preferably, the element comprises DNA for insertion into a host genome located between 
the minimal repeat and a repeat having the same orientation as the minimal repeat. 

The DNA for insertion into a host genome is preferably flanked by two pairs of opposing 
repeats excisable by a transposase in situ to leave said DNA without flanking repeats in the host 
genome, although it preferred that each of the repeats bounding the DNA for insertion into a host 
genome is a minimal repeat. 

It is preferred that at least one repeat distal to the DNA for insertion into a host genome 
in relation to a minimal repeat in the same orientation has an internal deletion or is otherwise 
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compromised over up to 50% of its length, thereby reducing the frequency of excision by a 
transposase at that repeat. 

Preferably, at least one genetic marker associated with an identifiable step in the 
transposition/excision process and more preferably, the marker is associated with the DNA for 
insertion into a host genome. 

It w ill b e a ppreciated t he greatest 1 ength o f a t ransposon i s t he d istance b etween, a nd 
including, the two opposing repeats farthest from each other. Other, shorter transposons can then 
be defined within the length of the longest transposon by the presence of other repeats. It is at 
least one of these non-terminal, or internal, repeats that are preferably minimal repeats. 

As used herein, the term 4 minimal repeat' applies to the highly conserved repeat 
sequences observed to be required for the activity of a given transposase. The piggyBac 
transposon, for example, has a 32bp terminal inverted repeat interrupted by a 4bp insertion at the 
5 f end and a 31bp insertion at the 3 1 end. This can be considered as two pairs of inverted repeats, 
one of 13bp and another of 19bp, these two being separated by 4bp at the 5' end and 31 bp at the 
3' end. The minimal repeat, in this instance, then comprises a 32bp repeat with a 4 or 31 bp 
insertion at the 5 f and 3* ends respectively. 

It is generally preferred to provide the flanking sequences commonly associated with the 
terminal repeats. In the case of piggyBac, this is preferably TTAA. 

In general, transposases will be more effective at cutting out shorter sequences so that, 
where a transposon has one 5* repeat and two 3' repeats, for example, the most common 
transposon that will be observed transferring to another locus will be the shorter, formed by the 
5' repeat together with the more proximal of the two 3* repeats. This preference is both 
inevitable, owing to the fact that the longer transposon will still be cut by the transposase, as well 
as being exacerbated by topological considerations, whereby repeats in closer proximity appear 
to be bound substantially better by transposases. 

The present invention overcomes this problem by employing at least one minimal repeat 
internal to the longest transposon of the construct. This minimal repeat is utilised at only very 
low rates for the original insertion, thereby strongly biasing the initial reaction to generating the 
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larger transposon. Any DNA of interest is located between the minimal repeat and a 
corresponding full length repeat in the same orientation. 

WO 0/0441504 discloses the use of a 3 end (3 inverted approach), but there is no mention 
of the use of minimal ends. There is a discussion of the use of inversion, similar to that used in 
conjunction with the transposable element comprising at least 4 inverted repeats according to the 
first aspect of the present invention. However, WO 0/0441504 only discloses the use of 
inversion in relation to the 3 end approach and not in relation to a 4 end approach. 

It will be appreciated that minimal repeats may simply comprise the minimum repeat 
necessary to effect transposition, and be associated with none of the original intervening DNA 
between the repeats observed in piggyBac, for example. However, the present invention 
envisages using some of the internal sequence, although it is preferred to keep this to a 
minimum, as greater lengths will increase the transposition frequency of the minimal repeat, 
thereby d iminishing t he d esirable b ias d iscussed a bove. A ccordingly, w hile i t i s p referred t o 
keep internal sequence associated with the minimal repeat to zero nucleotides, it is possible to 
use up to 100 bases, for example, of the original sequence, but it is preferred to use 50, or less, 
and preferably 1 0 or less. 

Correspondingly, for the full length sequences, there is no clearly established limit as to 
what constitutes full length. It is established, however, that between about 6 and 14% of the 
internal sequence of a naturally occurring transposon is sufficient to provide high levels of 
transposition frequency, so that the present invention generally prefers that 'full length repeat 
sequences' b e a ssociated w ith at 1 east 5 % o ft he o riginal i ntemal s equence o f the transposon 
from which they are derived, with between 6% and 14% being preferred, and 8% to 12% being 
more preferred. 

The transposons of the present invention may employ two or more minimal repeats, 
although two is the preferred maximum. Where two are used, then it is preferred that these both 
be internal and in opposite orientations. Where two smaller transposons flank the DNA of 
interest, this has the particular advantage of encouraging the full length transposable element to 
be incorporated into the genome and then losing the two smaller transposons, thereby leaving 
simply the DNA of interest without any flanking repeats. This method is also provided herein, 
simply using full length repeats, as defined above. 
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It is preferred to use a minimal piggyBac end as the more proximal of the distal ends and 
a fully-functional piggyBac end as the distal end. This arrangement strongly biases the initial 
transformation step towards insertion of the desired longer transposon. 

In an alternative embodiment, multiple copies of the 3' and/or 5' ends is provided, with 
minimal repeats preferably providing the inmost of any series of repeats in the same orientation. 
This increases the chance that one or more longer versions of the transposon are integrated in the 
initial transformation, as required. One version of this is illustrated diagrammatically in Figure 
2. 

The construct of Figure 2A is introduced into cells or embryos, for example, by 
microinjection, transfection, or ballistic or other methods, with a suitable transposase helper, for 
example, helper plasmid, RNA, transposase protein or an integrated transposase source. 

A transformant containing the integrated product of Figure 2B is then selected, and 
exposed to a suitable source of active transposase, for example, helper plasmid, RNA, 
transposase protein or integrated transposase source. Transformants containing the desired 
specific excision product, as shown in Figure 2C, can then be selected. 

While efficiency in the second, excision step is less critical, as large numbers of 
individuals can readily be screened, it is preferred that the excision preferentially removes the 
short transposon, rather than a long one, as frequently as possible. The present invention 
provides such a system. piggyBac elements with suitable deletions or mutations in their internal 
inverted repeat are competent for transposition but not for excision, or have reduced excision 
rates. Use of such a modified end as the distal end will therefore bias the excision reaction 
towards utilisation of the proximal end as a higher proportion of the total of excision events. 

Thus, it is preferred to use, as a terminal repeat, a repeat having a deletion of no more 
than 50%, or mutation or inversion that disables no more than 50% of the repeat. It is preferred 
that such a compromised repeat be in the same orientation as the minimal repeat, where there is 
only one. These repeats are readily transposed, but are not readily excised after transposition, 
thereby biasing the excision reaction towards the minimal repeat. Where more than one minimal 
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repeat is used, in opposing orientations, then more than one compromised repeat may also be 
used. 

The compromised repeat is generally preferred to correspond to a full length repeat, other 
than in respect of the deletion or mutation. 

In relation to either aspect of the present invention, it is possible that subsequent exposure 
to transposase will simply lead to excision of the entire sequence, in which case the organism is 
effectively unaltered from the original. The alternative provides the DNA of interest in 
combination with one full length repeat, but no corresponding inverted repeat, so that the insert 
is no longer part of a transposable element. 

It will also be appreciated that the sequences with which the repeats are associated need 
not correspond completely to the original sequences found in the naturally occurring 
transposons, and that variation and sequence degeneracy are encompassed within the scope of 
the present invention. In particular, it is preferred that any original sequence associated with the 
repeats of the present invention have at least 70% homology with the corresponding natural 
sequence, more preferably at least 80%, more preferably at least 90%, and particularly 95% or 
above, especially 100%. 

In order to ensure that the resulting organism has been transformed in the manner desired, 
it is preferred to utilise appropriate markers. These may be used in any manner suitable to 
inform the skilled person as to the status of the transformant For example, markers may be 
associated with the DNA of interest in order to demonstrate that the organism has successfully 
been transformed. Markers may be incorporated in the areas between repeats that are to be 
deleted, so that the initial transformant containing the full length sequence can be detected. Any 
suitable combination of markers may also be used. 

It is generally preferred that markers be selectable, either positive or negative, and 
suitable examples are illustrated hereinunder. 

The use of any transposable element is envisaged, but class II elements, such as Hermes, 
hobo, Minos, and mariner, are preferred, owing to their relatively high fidelity during 
transposition, and the piggyBac element, which is known to use the distal element of a repeated 
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pair at relatively high frequency [Elick, et al., (1997), Mol. Gen. Genet. 255, 605-610], is 
particularly preferred. 

Suitable transposable elements will be known to the skilled person and although 
reference is made to piggy Bac elements in several examples of the present invention, it will be 
appreciated that the present invention is not limited thereto and that such reference was made 
purely for the purpose of exemplification of he invention. 

The present invention also provides a method of creating a new marker, preferably by a 
rearrangement such as that discussed below. Furthermore, the present invention also extends to 
a method of removing a conditional lethal. If one or both of the flanking transposons contained a 
conditional lethal gene, one could use this as a negative selection for the original element, and 
thereby select the desired excision product, even if quite rare. This is equivalent to the use of 
visible markers, but simpler and requires less effort, so that more transformants can be screened. 

The present invention has been shown to work in insects as these are useful and well 
understood models for genetic transposition. However, it will be appreciated that the present 
invention will also be useful in a whole range of organisms for transposition into a DNA 
genome. Accordingly, the present invention is useful in mammals, but also in plants, fungi, and 
even prokaryotes and viruses. 

The present invention, therefore, also provides a method for transforming an organism, 
comprising exposing replicative cells or tissue of the organism to an element according to the 
present invention under conditions effective to incorporate the element into the genome thereof 
and, subsequently or simultaneously therewith, providing conditions suitable to excise a 
transposon from the genome, and selecting an organism, or cells or tissue therefor, comprising 
the DNA intended for insertion lacking repeats in at least one orientation. Preferably, the 
organism is a mammal, a plant, a fungus, a prokaryote, such as bacteria or a virus. 

The present invention also encompasses an organism obtained in accordance with the 
transposable element and method discussed herein. Preferably the organism is an insect. 

The final transgenic line preferably has no transposon DNA whatsoever associated with 
the insertion, unless this has been deliberately incorporated into the DNA of interest. Since there 
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is no known upper or lower limit for the length of transposition-competent piggyBac elements 
that would constrain the length of the DNA of interest, the present invention provides a method 
of very general utility for inserting DNA into genomes of cells or organisms. However, we 
generally prefer that the total length of the initial piggyBac transposon be in the range 3-25kb, as 
this corresponds approximately to the range of piggyBac transposons commonly used for 
germline transformation in insects. 

There is no lower limit to the amount of DNA that can be inserted by the overall 
procedure, after the flanking transposons have been excised. The initial insertion will retain the 
target site specificity of the original element, such as TTAA for piggyBac, with some apparent 
preference for (A/T)N(A/T)TTAA(A/T)N(A/T), which may also be written as 
WNWTTAAWNW (SEQ ID NO. 20) where "W" denotes A or T. Precise excision of the 
elements will resolve this to a duplication of the TTAA, flanking the DNA of interest, which can 
be as short as a single nucleotide. In the event that zero nucleotides are inserted, only the TTAA 
duplication remains. The insertion of larger fragments is generally preferred. 

A suitable example of a small insertion is a stop codon. Insertional mutagenesis using 
transposable elements is a well known method for genetic screens of various types. However, 
interpreting the phenotype may be complicated by the presence of the transposon, with its 
associated markers, promoters and other elements. A short insertion, such as TTAA or CTAG, 
which provides a total sequence between the piggyBac ends of TTAATTAATTAA (SEQ ID 
NO. 1) and TTAACTAGTTAA (SEQ ID NO. 2), respectively, allows the insertion to be 
resolved to a TTAA duplication with this four base insertion. TTAATTAATTAA (SEQ ID NO. 
1) and TTAACTAGTTAA (SEQ ID NO. 2), in these examples, provide stop codons in all three 
frames in both directions. An insert of zero base pairs provides a frame shift and a stop codon in 
two frames, although one of these is already present in the original TTAA. 

The present invention may be used to provide a coding region, such as for a fluorescent 
protein or a transactivator protein, such as GAL4, GAL4delta, or tTA. An insertional mutant 
may then be resolved to a fusion protein. As this needs to be in frame and in the correct 
orientation to function, it is generally preferable to also provide a suitable IRES (internal 
ribosome entry site) element and coding region, to allow bicistronic expression, or two such in 
opposite directions to allow bicistronic expression for insertions of either orientation. 
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The present invention may also be used to insert an enhancer or promoter or a suitable 
target site for a site-specific recombinase, either with or without a marker and other sequences. 
Initial experiments with the non-deleted transposon may serve to establish the presence or 
absence of specific enhancer effects on a particular insertion site, and the viability and other 
properties of individuals carrying large insertions at this site. Suitable insertions could then be 
resolved by stepwise excision to the recombinase target site, either with or without a marker and 
other sequences. This could then be used as a docking site to enable the insertion of other DNA 
sequences at this pre-determined and partially pre-characterised position. This arrangement has 
significant benefits in allowing the insertion of heterologous DNA at a predetermined site. If 
desired, the site can be protected from nearby enhancers by flanking the recombinase target site 
with suitable insulator elements, for example scs and scs', or HS4. 

The present invention may also be used to replace an existing transposon insertion with 
specific D NA i n s uch a w ay t hat t he i nsertion i s s ubstantially ore ompletely s table t o further 
exposure to active transposase. This can be performed by replacing the existing transposon with 
a composite transposon of the present invention. Methods for performing such a replacement 
have been described [c./ Johnson-Schlitz, et al. Mol Cell Biol 13, 7006-18 (1993); Cabrera, et 
al t Genesis 34, 62-5 (2002); Sepp, et a!., Genetics 151, 1093-101 (1999); Lankenau, et al t Mol 
Cell Biol 16, 3535-44 (1996); Gonzy-Treboul, et al., Genes Dev 9, 1137-48 (1995); Heslip, et 
al, Genetics 138, 1127-35 (1994); Gloor, etal, Science 253, 1110-7 (1991)]. 

Minimal molecular markers are generally characteristic of one aspect of the invention. It 
will be apparent to the person skilled in the art that PCR-based or other molecular analysis is 
capable of distinguishing each of the various possible forms at each stage. 

However, with respect to both aspects of the invention, it is generally more convenient to 
provide at least one visible or selectable marker, and this is preferred. Suitable markers are well 
known to the person skilled in the art, and include: genes encoding fluorescent proteins, 
including GFP, DsRed and their mutant derivatives; genes encoding drug or antibiotic resistance, 
such as neomycin phosphotransferase, or hygromycin resistance; and markers capable of 
functionally complementing a visible mutant in the host organism, such as mini-w/»Ye + or rosy* 
in Drosophila, white + in Ceratitis capitata or Drosophila cinnabar complementing kw w in Aedes 
aegypti. It is particularly preferred that at least one such marker be associated with the section of 
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DNA containing the DNA of interest, allowing the segment to be tracked through the integration 
and stepwise excision process. 

Where the DNA of interest segment is associated with two repeated ends, then each of 
the two flanking elements may be marked. Double selection may be used to recognise the initial 
insertion and then track the stepwise excisions. Since the last step has no high-probability 
alternatives, individuals with the DNA of interest can be recovered in the final step even though 
they no longer have any associated visible marker. Thus, the present invention provides a novel 
method for recovering transgenics with no selectable marker. 

Visible markers can generally be scored for or against, in other words are generally 
suitable for either positive or negative selection. Automated, or semi-automated systems for 
identifying, or identifying and s eparating individuals are available, and have thepotential for 
screening very large numbers of individuals, for example in the transformation or excision steps. 
For the excision step, negative selection for the appropriate transposon is desirable, together with 
positive selection for the DNA of interest, or an associated marker, as appropriate for the precise 
scheme selected. Negative selection markers are also known that can be used to select against 
particular individuals carrying them; such a system can be conveniently used to screen, or to help 
to screen large numbers of individuals for excision of a specific region. Suitable negative 
selection markers include inducible or repressible lethals, one half of a biphasic expression 
system such as GAL4/UAS or tTA/tRE, if the other half can be provided separately, dominant 
temperature sensitive lethals such as the Drosophila DTSs, or synthetic ones such as a suitable 
toxic element operably linked to a heat-shock or other inducible or repressible promoter. 

Multiple visible markers can be provided not only by using mutant derivatives of 
fluorescent proteins, for example, of GFP and DsRed, which are independently distinguishable 
based on their spectral properties (c.f. Horn et a/., 2002, supra), but alternatively by expressing 
the same or similar markers in different spatial or temporal patterns. For example, it is readily 
possible to distinguish between Drosophila Act5C-DsRed, which shows ubiquitous expression, 
particularly clear in the body of larvae and in the adult eye, and Act88F-DsRed, which shows in 
indirect flight muscles only, and is, therefore, visible in the thorax of late pupae and adults. 
These markers are, therefore, separated by both their spatial and their temporal patterns and 
insects carrying one, the other, or both can readily be distinguished. 
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Where the present invention relates to transposable elements with two repeated ends, 
though the initial transformation reaction and the first excision reaction have several potential 
outcomes, of which not all are desirable, the second excision reaction has only one excision 
product, and this is the desired one. Accordingly, it is possible not to mark one of the flanking 
transposons, to select by suitable methods the desired products of the first two reactions, and 
then to conduct the second excision reaction blind, by exposure to transposase, for example, at 
high concentration, or for several generations of exposure to a jumpstarter element, then identify 
the desired reaction product by molecular methods. It is also possible to perform both excision 
reactions in this way, selecting only for the presence of a marker in the intervening sequence. It 
is, therefore, possible to perform the entire sequence of reactions using only a marker in this 
region. This embodiment provides for particularly short flanking transposons, and thus for the 
maximum size of insert for a given initial composite transposon size. 

It will be appreciated that though each step of the insertion sequence is described 
separately, in practice, exposure to transposase may induce several of these steps to occur within 
one generation, or without the intermediate being specifically identified. This is generally 
acceptable and may provide a faster route to the desired final structure, and may be encouraged 
or stimulated by the use of relatively high concentrations of transposase, or the use of 
hyperactive transposase or cw-acting sequences, if desired. It will be understood that suitable 
use of molecular and/or selectable markers can facilitate this process. 

Transposases are necessary to the function of the present invention, but it is not critical as 
to how they are provided. They may be provided in any suitable manner, as detailed below, and 
may be inherent in the cell, provided on plasmids or even provided within the element itself, 
although this is not preferred. The cell may also be dosed with the enzyme or mRNA encoding 
the enzyme, or even with a virus expressing the enzyme, for example. 

Suitable transposase can be provided in any of several forms: injection or electroporation, 
for example, of a plasmid or RNA encoding the transposase, or of transposase protein itself. A 
transposase source may also be integrated into the target genome, to provide a jumpstarter 1 
construct, or line. This is a preferred method for the excision steps; the jumpstarter element can 
be combined with the initial insertion by conventional breeding, or by making the primary 
transformant in the jumpstarter line. Excision products will then be generated spontaneously, 
without requiring further injection or electroporation. With suitable markers or molecular 



WO 2005/003364 PCT/GB2004/002869 

21 

analysis, the desired products can readily be isolated, and separated from the jumpstarter, if 
required, by conventional breeding. 

Although jumpstarters provided within the transposons of the invention are not generally 
preferred, it is possible to provide a coding sequence therefor in a shorter internal transposon that 
it is desired to excise. Thus, it is possible to generate a jumpstarter line through insertion of an 
autonomous element, followed by its resolution by self-catalysed excision. 

Transposition is thought to work by a number of mechanisms. One such mechanism 
involves "scanning" by the transposase. In this mechanism, the transposase binds to one repeat, 
probably a specific repeat, and scans along the DNA looking for the other repeat. This scanning 
may or may not be directional. However, where more than one repeat can be recognised by the 
transposase, as shown for instance in Figure 2, the nearer of two duplicated repeats will be used 
much more frequently than the other repeat. 

This would not appear to be the case for piggyBac, which is, therefore, thought not to use 
such a scanning mechanism, but probably is the case for other class II transposons. 

Therefore, in order to enhance the rate of transposition of the full length element 
comprising the gene of interest, minimal ends may be used, as described above, or an internal 
rearrangement of the element after insertion can be induced. 



The rearrangement or inversion approach starts with a plasmid with only two functional 
transposable elements, where one transposable element includes the DNA of interest, whereas 
the other uses the "internal" ends and excludes the DNA of interest, see Figure 3A. After 
transposition into the genome and recombinase-induced rearrangement/inversion, the initial 
transposon is converted into a sequence comprising the gene-of-interest and two flanking 
transposable elements, together with a longer transposable element comprising all three, see 
Figure 3B. However, under the scanning mechanism, the two shorter flanking transposable 
elements should then be much more readily mobilised, leaving only the gene-of-interest and one 
(Figure 3C), but preferably neither (Figure 3D), of the flanking elements remaining. 
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Inversion is when a chromosomal segment is excised and reinserted in the same place but 
turned 180 degrees from its original orientation, so that the gene sequence for the segment is 
reversed with respect to that of the rest of the chromosome. 

Suitable systems for such an inversion are well known in the art, such as Cre/lox and 
Flp/Frt recombinase systems. The ere recombinase is a product of lambda phage in Escherichia 
coli, and the FLP recombinase is an enzyme native to the 2 micron plasmid of Saccharomyces 
cerevisiae. These recombinases alter the arrangement of DNA sequences in very specific ways. 
The FLP recombinase, for instance, is active at a particular 34 base pair DNA sequence, termed 
the FRT (FLP recombinase target) sequence. When two of these FRT sites are present, the FLP 
enzyme creates double-stranded breaks in the DNA strands, exchanges the ends of the first FRT 
with those of the second target sequence, and then reattaches the exchanged strands. This process 
leads to inversion or deletion of the DNA which lies between the two sites. Whether there is an 
inversion or deletion depends on the orientation of the FRT sites: if the sites are in the same 
direction, the intervening DNA will be deleted, but if the sites are in opposite orientation, the 
DNA is inverted. 

Recombination using wild-type lox or FRT sites is reversible. However, the use of 
directional recombinase sites, e.g. mutants of lox such as lox66 and lox71, which will recombine 
with each other in an essentially irreversible reaction, is also envisaged, and generally preferred 
as these will prevent re-inversion whereby the inversion cassette is subjected to a second round 
of inversion, such that it is returned to its original orientation. Therefore, the use of directional 
recombinases biases the inversion to give only the desired product. Other recombinases, whose 
wild type targets are non-identical and have an essentially irreversible reaction, are also known, 
e.g. phage C31 integrase acting on attP and attB sites. 

Therefore, it is envisaged that the element comprises two external opposed inverted 
repeats bounding an inversion cassette. The cassette comprises the DNA for insertion into a host 
genome, together with two inverted repeats and two inversion sites. The DNA for insertion into 
a host genome is flanked on either side by one of the inverted repeats, each inverted repeat being 
further flanked by an inversion site, such that reading 5' -3* along one strand of DNA in the 
cassette, there is provided an inversion site such as FRT, a first inverted repeat, the gene or other 
DNA of interest, followed by a second inverted repeat and finally by the second inversion site. 
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The cassette is capable of inversion within the transposed element in situ, in the presence 
of a recombinase. Once the full length transposon has been transposed into the host genome, the 
recombinase induces inversion of the DNA between the inversion sites. 

The result of this is that that following inversion, the two inverted repeats, flanking the 
DNA of interest, each separately form a new pair of opposing inverted repeats. They do so with 
one of the external inverted repeats, thus forming a new pair of opposing repeats that are being 
excisable by a transposase in situ, thus leaving the DNA of interest without flanking transposon- 
derived repeats in the host genome. 

The inversion sites, are preferably suitably orientated with respect to each other to allow 
inversion rather than deletion of the DNA therebetween. For instance, with the Flp/FRT system 
commonly known to in the art and used here as a non-limiting example only, are preferably 
aligned in opposite orientations to allow inversion rather than deletion. 

The inversion sites are preferably suitably orientated with respect to each other to allow 
inversion rather than deletion of the DNA therebetween, 

As with the transposase the recombinase may, preferably, be encoded by the transposable 
element of the present invention or, even more preferably, separately therefrom. 

Clearly, this recombinase-based inversion approach requires an additional step. The 
efficiency of this step could be increased by using mutant recombinase target sequences, which 
make the reaction essentially irreversible. A good marker system would also be useful to detect 
such rearrangements and is provided in accordance with the invention. Here, for instance, the 
recombinase target sites are embedded in two different markers, such that following 
rearrangement, two new markers are created. 

For example, marker 1 might be EGFP under the control of an eye-specific promoter; 
marker 2 m ight b e D sRed under the c ontrol o f a ubiquitous orb ody-specific p remoter. T he 
recombinase target site is included in each marker between the promoter and the coding region 
for the fluorescent protein. The initial transgenic then exhibits green fluorescent eyes and red 
fluorescent body, while individuals carrying the desired recombination product (which is an 
inversion of an internal section of the composite transposon) show red fluorescent eyes and a 
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green fluorescent body. Individuals carrying the desired inversion should therefore be readily 
detected, even if rare. 

Accordingly, the present invention also provides a method of creating a new marker by 
such a rearrangement, such as the red eyes produced in the above example. Furthermore, the 
present invention also extends to a method of removing a conditional lethal. 

Without being bound by theory, it is worth noting that the 31 base pair spacer found in 
the right hand (3*) piggyBac repeat may wrap around a protein, in contrast to the 3 or 4 base pair 
spacer in the 5' repeat, perhaps suggesting that the transposon is an evolutionary ancestor of the 
VDJ antibody variation domain of the MHC, where a similar mechanism is also seen. 
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Experiment 1: Use Of Minimal Ends 

We made construct pLA1025 (see Figure 5 and SEQ ID NO. 3), which includes minimal 
piggyBac ends (35bp and 63bp bases for 5 f and 3' respectively). We generated transgenic 
Drosophila melanogaster carrying this construct by co-injection with a helper plasmid into a 
white-eyed strain (Handler and James 2000; Handler 2002, below). Potential transgenics were 
screened for fluorescence characteristic of EGFP and DsRed. pLA1025 contains 4 potential 
transposons, marked with [Actin5C-EGFP, Actin5C-DsRed2 and Actin88F-DsRed2], [Actin5C- 
EGFP and Actin88F-DsRed2 but not Actin5C-DsRed2], [Actin5C-EGFP only] and [Actin88F- 
DsRed2 only], respectively. 

If all transposon ends were utilised with equal efficiency, one might expect these 4 transposons 
to be recovered with equal frequency, or possibly a higher frequency for the shorter transposons 
which contain only one marker. Of 5 independent transgenic lines recovered from 85 GO adult 
injection survivors, all showed all three markers, and therefore correspond to the desired 
composite t ransposon. This i ndicates t hat t he u tilisation o f e nds a nd n ature o f t he recovered 
transposons can be biased in a desirable way by using internal ends that are less readily utilised 
for plasmid-to-germline transposition. 

A diagrammatic representation of the pLA1025 construct is shown in Figure 5 and its predicted 
sequence given in SEQ ID NO. 3. 



Experiment 2: Resolution Of Four-End Constructs To Give Transgenic 
Strains In Which DNA Of Interest Is No Longer Flanked By Transposon 

DNA. 

We made construct pLA1125 which includes 4 piggyBac ends, two each of 5' and 3' ends. 
pLAl 125 is shown diagrammatically in figure 6 and its predicted sequence is shown in (SEQ ID 
NO. 4) 



WO 2005/003364 



26 



PCT/GB2004/002869 



We generated transgenic Drosophila melanogaster carrying this construct by co-injection with a 
helper plasmid into a white-eyed strain (Handler and James 2000; Handler 2002, below). 
Potential transgenics were screened for fluorescence characteristic of DsRed. We recovered 
transgenics in the progeny of 12 out of 227 adult GO injection survivors. PCR analysis for the 
presence of each of regions 1, 2 and 3, and their contiguous presence, showed that all these 
transgenic lines contained a composite transposon of the expected structure. 

This demonstrated that, although pLA1125 contains 4 potential transposons, transgenics 
containing the desired composite transposon (PB5 , -ZsGreen-PB3 , -DsRed2-PB5 -AmCyan-PB3') 
could be recovered at a useful rate. We found that neither ZsGreen nor AmCyan could be 
reliably scored by fluorescence microscopy in these transgenic flies, despite the presence of the 
coding regions for these proteins as analysed by PCR, so subsequent experiments used 
fluorescence microscopy as a reliable determinant of the presence of DsRed only. Parallel 
experiments with other constructs indicated that the OpJE2 promoter fragment used to drive 
expression of ZsGreen and AmCyan in LAI 125 gives only weak expression in Drosophila, 
furthermore the AmCyan region of LAI 125 contains a frameshift likely to reduce or eliminate 
production of functional AmCyan protein. 

One of these transgenic LAI 125 lines (linel2) was selected for further study. We determined 
the insertion site by obtaining flanking DNA and sequencing, then comparing this sequence with 
the known genomic DNA sequence of Drosophila melanogaster from the Drosophila genome 
project. This showed that the composite LAI 125 element had inserted into the sequence located 
on chromosome 2. This is shown in Figure 7 where the site of insertion is written in capital 
letters and underlined, and in (SEQ ID NO. 5). 

We exposed the integrated LAI 125 line 12 (henceforth "1125-12") to piggyBac transposase by 
crossing to piggyBac "jumpstarter" lines, either pCasper-hs-orf (see http://piggvbac.bio.nd.eduA . 
or pHer{3xP3-ECFP, hsp70-piggyBac} (Horn, N et al. 2003). Progeny of these crosses were 
heatshocked during embryonic and larval development (37°C for 1 hr, 2x per week, starting 0-7 
days after egg deposition, at which point parents were removed, stopping when first pupae were 
observed in vial). Double heterozygous (1125-12/+ and jumpstarter/+) Fl adults were selected, 
based on marker expression. 
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Somatic transposition 

Extracts from pools of 5 double heterozygous (1125-12/+ and jumpstarter/+) adults, or from 
individual flies, were analysed for excision of the flanking simple transposons by PCR, see 
Tables 1 a nd 2 b elow. Precise excision o f e ither e lement w ould b ring t he flanking genomic 
DNA closer to the central DsRed gene. This would lead to a characteristic, predictable, 
reduction in size of a PCR product using Dro-12m-4 + su-a5c-rl primers or Dro-12m-l + Diag- 
droso primers for PCR, respectively detecting excision of the ZsGreen or AmCyan simple 
transposon. 



Primer's name 


Predicted size of PCR 
product from 1125-12 


Predicted size of PCR product 
from 1125-12 following deletion 
of ZsGreen simple transposon 


Dro-12m-4 + su-a5c-rl 


3058 bp 


277 bp 



Table 1 



Primers 


Predicted size of PCR 
product from 1125-12 


Predicted size of PCR product 
from 1125-12 following deletion 
of AmCyan simple transposon 


Dro-12m-l + Diag-droso 


2913 bp 


268 bp 



Table 2 



In all pools and all individuals analysed, we observed the characteristic bands expected for 
precise e xcision o f e ach s imple t ransposon ( Figure 8). Wedidnoto bserve s uch b ands from 
similar extracts of flies from the 1 125-12 line which had not been exposed to transposase. We 
therefore conclude that it is possible to eliminate each of the two flanking transposons from the 
composite 1125 transposon, while retaining the central region. We further conclude that this 
excision is substantially dependent on the presence of piggyBac transposase, as expected. Since 
the PCR band is of the expected size for a precise excision, and piggyBac excisions are known to 
be precise in the large majority of cases, we conclude that precise somatic excision of the 
flanking elements is occurring at a readily detectable rate. 
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Figure 8 shows the PCR detection of somatic excision of the flanking transposons in Fl adults 

flies after exposure to transposase, wherein: 

M - DNA marker (SmartLadder, Eurogentec) 

A = PCR for the excision of ZsGreen simple transposon 

B - PCR for the excision of AmCyan simple transposon 

Al and B 1 - template DNA from a pool of 5 Fl flies which were exposed to transposase 
A2 and B2 = template DNA from a single Fl fly which was exposed to transposase 
A3 and B3 = template DNA from a single fly which had no exposure to transposase 

Germ-line transposition 

In order to determine whether the flanking transposons could be eliminated in the germ-line, to 
generate gametes containing the excision products and hence a heritable excision product, double 
heterozygous (1125-12/+ and jumpstarter/+) Fl adults were crossed to white-eyed (w IU8 ) flies. 
Pools of 7 F2 progeny flies, selected for the presence of the DsRed marker by fluorescence 
microscopy, were obtained. E xtracts from these pools were analysed f or t he p resence o f the 
specific excision products as above. Several pools yielded specific PCR bands as for the somatic 
excision assay above. One example is shown in Figure 9. This indicates that germline excision 
can occur as for somatic excision above. 

Figure 9 shows the PCR detection of germline excision of ZsGreen simple transposon, wherein: 
M = DNA marker (SmartLadder, Eurogentec) 

1 - 7 = DNA extracted from different pools of 7 F2 1 125-12 flies (with DsRed) 
Dro-12m-4 and su-a5c-rl primers were used in these PCR reactions 

1 92 i ndividual flies w ere a nalysed f or e xcision by u sing t he p rimers w hich h ad b een u sed t o 
detect the somatic and the germline excision of ZsGreen and Amcyan simple transposons. These 
flies were also analyised for the presence of ZsGreen (su-zsgreen-f + su-zsgreen-r) and Amcyan 
(su-amcyan-f + su-amcyan-r) simple transposons and for connections between the central DsRed 
region with ZsGreen (Diag-rab-b-g + su-a5c-rl) and with AmCyan (Opie2-nhe-R + Diag-droso) 
simple transposons. 

The somatic excision data, and the data from pools of F2 individuals, clearly indicates that 
precise excision of the flanking transposons can occur. However, in a careful PCR-based 
analysis of four of the individual F2 flies showing excision events, we were unable to amplify a 
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diagnostic flanking DNA band of the type used to show precise excision in somatic cells. 
However, these flies had clearly lost one or both of their flanking transposons, as judged by the 
inability to amplify unique regions from one or both of these transposons (primers su-zsgreen-f 
and su-zsgreen-r for the ZsGreen transposon and su-amcyan-f and su-amcyan-r for the AmCyan 
transposon), while being readily able to amplify a unique region from the central DsRed region 
(primers su-red-f and su-red-r) and indeed to observe fluorescence characteristic of DsRed in 
these flies. 



We conclude that the excision event that removed the flanking simple transposons was 
associated with a rearrangement, possibly a deletion of flanking genomic DNA but, more likely, 
a transposition of the composite transposon such that it is no longer associated with the original 
flanking DNA but is now inserted at a new site in the genome. The ability to recover 
remobilised, products of this type is an advantage, as it indicates that an initial primary 
transformant can be simultaneously relocated to another site and stabilised by elimination of one 
or both flanking transposons, which saves a considerable amount of time over the multi- 
generational breeding scheme that would be required to obtain such a product if each step had to 
be performed sequentially. 

We further analysed the two DsRed-positive flies in which neither ZsGreen nor AmCyan 
appeared to be present. PCR using primers PB5-sh-srf-asc-l and PB5-sh-not-hind-2, which 
amplify a section of piggyBac, showed that no such piggyBac DNA is present anywhere in these 
flies (Figure 10). The ZsGreen and AmCyan transposons have not, therefore, rearranged or 
transposed to another part of the genomes of these flies, rather they have been completely lost, 
presumably by excision in the germ-line of the Fl parent. We have therefore demonstrated that 
it is possible by this method to generate transgenic flies in which the inserted DNA is not 
associated with the ends of a transposon. 

Figure 10 shows the results of PCR analysis of piggyBac sequence in DsRed stable 

transformants, wherein: 

M = DNA marker (SmartLadder, Eurogentec) 

1 - DNA from LAI 125-12 flies unexposed to transoposase used as positive control 

2 = DNA from fly 70 (has neither ZsGreen nor AmCyan) 

3 = DNA from fly 200 (has neither ZsGreen nor AmCyan) 
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Markers 

The ZsGreen and AmCyan open reading frames were used only as molecular markers in these 
experiments. Use of visible markers would reduce the amount of PCR required, as phenotypic 
scoring for the presence of such markers, e.g. for the production of fluorescent proteins, could be 
used to infer excision and other events. However, we have clearly demonstrated that this is not 
necessary, that these events occur at sufficient frequency that molecular-based screening alone is 
sufficient to recover the desired types, e.g. excision products. Two variants of LAI 125 are 
therefore proposed based on the above data: (i) an equivalent construct with visible markers in 
the flanking transposons and (ii) an equivalent construct with shorter flanking transposons 
containing no such visible markers. The second of these options should give higher transposition 
and excision frequencies, as shorter transposons are generally thought to be more active. A short 
segment of unique sequence in each of the flanking transposons would be helpful for the 
molecular analysis. Such a construct is also the product of a Flp/Frt, Cre/lox or other similar 
inversion system following recombinase-mediated inversion of the central section of the 
construct described. 

Vector 

pLA1125 provides several unique sequences into which additional DNA could be inserted. 
Particularly convenient in this regard are the Ascl and Srfl sites in the central section. pLAl 125, 
and the variants described above, therefore provide vectors of general utility for the method of 
the invention. 

Primers 

Primers are given below in Table 3 



Primer Name 


Sequence 


Dro-12m-l 


gccagtcctgcagcttcagcgtgtg (SEQ ID NO. 6) 


Dro-12m-4 


aaaggtatgatggttgaagtatccgc (SEQ ID NO. 7) 


Diag-droso 


ccttctttgttgaatccagatcctgc (SEQ ID NO. 8) 


Diag-Ra-b-g 


gctcctgggcaacgtgctggttg (SEQ ID NO. 9) 


Opie2-nhe-R 


ggtggctagcttgcgcttcttcttgggtgggg (SEQ ID NO. 10) 
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PB5-sh-srf-asc-l 


ggtgggcgcgcccaattgcccgggctttttaaccctagaaagatagtctgcgt 
(SEQIDNO. 11) 


PB5-sh-not-hind-2 


gcccaagcttgcggccgcgtcattttgactcacgcggtcg (SEQ ID 
NO. 12) 


su-a5c-rl 


gatttgattcagagttgatgccattcatg (SEQ ID NO. 13) 


su-amcyan-f 


ccagacctccaccttcaaggtgacc (SEQ ID NO. 14) 


su-amcyan-r 


ttgtaggaggtgtggaactggcatctg (SEQ ID NO. 1 5) 


su-red-f 


caacaccgtgaagctgaaggtgacc (SEQ ID NO. 16) 


su-red-r 


cttggccatgtagatggacttgaactcc (SEQ ID NO. 17) 


su-zsgreen-f 


caagcaggccatcaacctgtgc (SEQ ID NO. 18) 


su-zsgreen-r 


gacttggccttgtacacggtgtcg (SEQ ID NO. 19) 
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